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ABSTRACT 



A method and system thereof for processing data in a 
computer graphics system. More specifically, an anti- 
aliasing buffer architecture for processing fragments in a 
computer graphics system is described. When a new frag- 
ment for a particular pixel location is received, the fragment 
stack for that pixel location is read from fragment memory. 
The new fragment is appended to the fragment stack, and the 
resultant fragment stack is written back to fragment memory 
before it is processed in a computer graphics pipeline. 
Fragments stored in fragment memory are not sorted accord- 
ing to their distance from the view plane (the z-dimension); 
instead, z-ordered depth sorting is performed in the com- 
puter graphics pipeline. Using an occlude command, 
occluded (blocked) fragments can be deleted from the 
fragment stack before the fragment stack is passed to the 
computer graphics pipeline. The computer graphics pipeline 
calculates a pixel color for each pixel location. Multiple 
computer graphics pipelines can be executed in parallel, and 
the pixel colors determined in each pipeline are interleaved 
and stored in a frame buffer. Because fragment stacks are 
read from and written back to fragment memory before 
processing in the computer graphics pipeline, processing is 
not held up waiting for the reads and writes to occur and data 
throughput in the pipeline is increased. Consequently, the 
overall performance of the computer graphics system is 
improved. 

27 Claims, 8 Drawing Sheets 
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(Prior Art) 
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METHOD AND DESIGN FOR IMPROVED 
FRAGMENT PROCESSING 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to the field of data process- 
ing in a computer graphics system. More specifically, the 
present invention pertains to an anti-aliasing buffer archi- 
tecture for processing fragments in a computer graphics 
system. 

2. Background Art 

Computer graphics generally consists of instructions 
implemented via a graphics system executed on a computer 
system. The instructions are used to specify the calculations 
and operations needed to produce rendered images that have 
a three-dimensional appearance. 

The computer graphics system can be envisioned, in part, 
as a pipeline through which pixel data pass. The data are 
used to define the image to be produced and displayed. At 
various points along the pipeline, various calculations and 
operations are specified by the graphics designer, and the 
data are modified accordingly. 

In the initial stages of the pipeline, the desired image is 
composed using geometric shapes such as lines and 
polygons, referred to in the art as geometric "primitives." 
The derivation of the vertices for an image and the manipu- 
lation of the vertices to provide animation entail performing 
numerous geometric calculations in order to project the 
three-dimensional world being designed to a position in the 
two-dimensional world (the "view plane") of the display 
screen. 

Primitives are then decomposed into "fragments," and 
these fragments are assigned attributes such as color, 
perspective, and texture. In order to enhance the quality of 
the image, effects such as lighting, fog, and shading are 
added, and anti-aliasing and blending functions are used to 
give the image a smoother and more realistic appearance. In 
the final image generation stage, the fragments and their 
associated attributes are stored in the frame buffer as pixels. 
The pixel values can be later read from the frame buffer, and 
can be used to display images on the computer screen. 

The entire process, from projecting the primitives onto the 
view plane through formation of the output image, is known 
as rendering. The specific process of decomposing indi- 
vidual primitives and determining per-pixel or per-fragment 
values from those geometric primitives is known as raster- 
ization. 

With reference now to Prior Art FIG. 1, process 130 
exemplifies one embodiment of a graphics design process 
implemented using a graphics program on a computer 
system. Process 130 operates on vertex (or geometric) data 
131. The blocks within process 130 consist of display lust 
133, evaluators 134, per-vertex operations and primitive 
assembly 135, rasterization 138, per-fragment operations 
139, and frame buffer 140. 

Vertex data 131 are loaded from the computer system's 
memory and saved in display list 133; however, in some 
graphics programs, a display list is not used and, instead, the 
vertex data are processed immediately. When display list 
133 is executed, evaluators 134 derive the coordinates, or 
vertices, that are used to describe points, lines, polygons, 
and the like (e.g., primitives). All geometric primitives arc 
eventually described by collections of vertices. 

With reference still to Prior Art FIG. 1, in per-vertex 
operations and primitive assembly 135, vertex data 131 are 
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converted into primitives that are assembled to represent the 
surfaces to be graphically displayed. Some vertex data (for 
example, spatial coordinates) are transformed, typically 
using matrix multiplication, to project the spatial coordi- 

5 nates from a position in the three-dimensional world to a 
position on the display screen. 

In addition, advanced features are also performed in 
per-vertex operations and primitive assembly 135. Texturing 
coordinates may be generated and transformed. Lighting 

10 calculations are performed using the transformed vertex, the 
surface normal, material properties, and other lighting infor- 
mation to produce a color value. Perspective division, which 
is used to make distant objects appear smaller than closer 
objects in the display, also occurs in per-vertex operations 

15 and primitive assembly 135. 

Rasterization 138 is the conversion of vertex data into 
"fragments." Each fragment corresponds to a single element 
(e.g., a "pixel") in the graphics display, and typically 
includes data defining color, shading, and texture. Per- 

20 fragment operations 139 consist of additional operations that 
may be enabled to enhance the detail of the fragments, such 
as blending, dithering and other like operations. After 
completion of these operations, the processing of the frag- 
ment is complete and it is written as a pixel to frame buffer 

25 140. 

Part of the process of anti-aliasing is performed during 
rasterization 138. Anti-aliasing is a technique for correcting 
the problem of aliasing, which can cause the edges of an 

3Q object to appear jagged when the object is rendered. For 
example, a polygon may only partially cover a number of 
pixels; that is, the edge of the polygon may pass through a 
number of adjacent pixels. If these pixels are approximated 
as being fully covered by the polygon (and are colored the 

35 color of the polygon), the edge the polygon would likely 
appear as jagged when it is rendered. In addition, the 
fragments that correspond to each screen pixel must be kept 
in a sorted order according to their distance from the view 
plane. 

40 In general, there are two common approaches for imple- 
menting this sorted ordering: the "Z-buffer" approach and 
the "A-buffer" approach. In the Z-buffer approach, pixel 
data including a depth value (e.g., a z-dimension indicating 
distance from a view plane) are stored for every pixel 

45 location in a display image. As geometric primitives are 
rasterized, the depth values for newly generated pixel data 
are compared to depth values for pixel data in the Z-buffer. 
If the newly generated pixel data are closer to the view plane 
(e.g., a smaller value of z), then these data are written over 

50 the current pixel data in the Z-buffer. If the newly generated 
pixel data have a larger value of z, then the new data are 
disregarded. 

The Z-buffer approach will always result in aliasing 
because it does not adequately address partially covered 
55 pixels. The A-buffer approach improves the Z-buffer 
approach by addressing anti-aliasing for partially covered 
pixels. 

In the A-buffer approach, polygons arc clipped into frag- 
ments at the boundaries of a pixel. For example, consider a 

60 square-shaped pixel partially covered by a portion of a 
polygon. A pixel fragment would be generated to represent 
the portion of the polygon covering the pixel. A bit mask 
representing the edges of the polygon is used to describe 
how the polygon partially covers the pixel. When there are 

65 multiple polygons contributing to the color of a pixel, 
multiple pixel fragments are generated, and the colors of the 
fragments are then resolved within the A-buffer to compute 
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a final color for the pixel The multiple fragments corre- 
sponding to a pixel location are stored in memory as a 
"fragment stack." 

Prior Art FIG. 2 is a data flow diagram showing one 
embodiment of an anti-aliasing buffer (A-buffer) architec- 
ture 200 used in a computer graphics system. In A-buffer 
architecture 200, the fragment data flow in a loop from 
fragment memory 210 to fragment manager 220, through 
fragment evaluation pipeline 230a or 2306, then back to 
fragment memory 210. The process may also be performed 
using parallel paths, each path with distinct fragment 
memory, fragment manager, evaluation pipeline(s), and 
result router. 

A new fragment 202 for a particular pixel location causes 
fragment stack 212 for that pixel location to be read from 
fragment memory 210. Fragment manager 220 feeds frag- 
ment stack 212 to one of the fragment evaluation pipelines 
230a or 2306 (it is appreciated that more than two pipelins 
can be used in A-buffer architecture 200). 

After evaluation in the pipeline, the final pixel color for 
the pixel location is determined, and pixel data 246 
(comprising the pixel color and pixel location) are sent to 
frame buffer 140 via evaluation result router 240. Also, 
evaluation result router 240 writes the processed fragment 
stack 235 back to fragment memory 210, so that fragment 
stack 235 can be accessed the next time a new fragment is 
received for the pixel location. Thus, in the prior art, the data 
that are output from pipelines 230a and 2306 are written to 
fragment memory 210, Smaller memory caches are some- 
times used in conjunction with fragment memory 210 to 
speed up access to repetitively used fragment stacks. 

In most implementations, fragment memory 210 uses a 
paging scheme to read and write fragment stacks (e.g., 
fragment stacks 212 and 235, respectively). These imple- 
mentations take into account that the average polygon size 
is fifty pixels or less. As such, the average width of the 
polygon is eight pixels or less. Because of the amount of data 
stored for each fragment, and the need to keep memory 
pages small for quick access to prevent visual artifacts, most 
polygons will cross at least one memory page boundary. 
That is, the page position is fixed but the polygon position 
is not (in an animated image), and therefore it is probable 
that polygons will cross the page boundary. 

Prior Art FIG. 3 illustrates a polygon 310 spanning 
multiple memory pages 320, 322, 324 and 326 in a fragment 
memory 210 used with a computer graphics system. Frag- 
ment memory 210 is aligned with the display screen coor- 
dinates. Accordingly, a fragment stack (exemplified by frag- 
ment stack 315) is associated with a particular pixel location 
on the display screen. 

Each fragment in fragment stack 315 typically is associ- 
ated with the following information that must be stored in 
fragment memory 210: a transparency flag (one bit); a 
transparency value (eight bits); a stencil flag (one bit); a 
stencil value (eight bits); a fragment color (red, green or 
blue, typically at least eight bits each; some systems use ten 
bits each); a fragment mask (typically, at least 16 bits); a 
fragment z-dimension (typically, 24 bits); various stack 
control flags (typically, 6 bits); an offset to the next fragment 
in the stack (at least eight bits); the total number of bits 
required for each fragment in a stack (at least 96 bits, or 12 
bytes); and in some implementations, a header word that 
holds the pixel location, current final color, touched flag, and 
an offset to the first entry in the stack. 

With reference back to Prior Art FIG. 2, A-buffer archi- 
tecture 200 has several disadvantages. One disadvantage to 
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the prior art is associated with the paging scheme used by 
fragment memory 210. Swapping memory pages is time- 
consuming in the context of a processing pipeline (e.g., 
fragment evaluation pipeline 230a) that is trying to keep up 

5 with a high performance graphics system that may be 
producing tens of millions fragments (or more) per second. 
The size of the memory page must be tuned to the graphics 
system in order to optimize performance. If the page size is 
made too small, then memory pages must be swapped more 

10 often. On the other hand, if the page size is made too large, 
then it takes more time to swap them, and this may cause 
visible artifacts in the display. 

Also associated with this disadvantage is the depth 
(number of stages) in fragment evaluation pipelines 230a 

15 and 2306. To gain the necessary data throughput, fragment 
evaluation pipelines 230a and 2306 must include multiple 
stages. Generally, there are at least eight stages to a pipeline: 
pixel ownership test, scissors test, alpha test, stencil test, 
depth test, blend, dither, and logic operations. Each stage of 

20 fragment evaluation pipelines 230a and 2306 will be oper- 
ating on a different fragment stack. Typically, a fragment 
stack moves from one stage in the pipeline to the next stage 
in the pipeline every "N" clock cycles. Each stage in the 
pipeline has a set number, N, of clock cycles to complete its 

25 function. The slowest stage in the pipeline (the stage with the 
maximum N) controls the speed of the pipeline. 

With reference to Prior Art FIGS. 2 and 3, it is common 
for a fragment stack to be in the process of being read from 
one page in fragment memory 210 while another fragment 

30 stack waits to be written to a different page in fragment 
memory 210. That is, fragment stack 212 may be in the 
process of being read from memory page 322; in the 
meantime, fragment stack 235 has been processed in frag- 
ment evaluation pipeline 230a and needs to be written to a 

35 different memory page 320. This causes "memory (or page) 
thrashing" or "page misses," in which one memory page is 
open in order to read data and another memory page needs 
to be opened in order to write data (and vice versa). As a 
result of memory thrashing, processing is delayed until the 

40 necessary read or write can be accomplished. The processing 
delay may be manifested as lag or a loss in detail in the 
rendered image. 

Consequently, another problem with the prior art A-buffer 

45 architecture 200 occurs when the data that are output from 
pipelines 230a and 2306 are written to fragment memory 
210. A different memory page may be open, and thus the 
post-pipeline write operation is delayed until the proper 
memory page can be opened and the processed data (e.g., 

5Q fragment stack 235) written to fragment memory 210. This 
delay (due to memory thrashing) will be propagated back 
through pipelines 230a and 2306, thereby reducing the 
processing efficiency and data throughput of the computer 
graphics system. 

55 Similarly, if data (e.g., fragment stack 212) are to be read 
from one page in fragment memory 210, but a different page 
is open because the post-pipeline write operation is ongoing, 
then the read will be delayed until the proper memory page 
can be opened. As a result, pipelines 230a and 2306 may be 

60 starved for data, with processing delayed until data are 
received. 

In addition, fragment evaluation pipelines are being 
increased in length in order to add stages that provide 
effects, such as shadows, that increase the realism of the 
65 rendered images. Longer pipelines can exacerbate the prob- 
lem of memory thrashing by increasing the likelihood that 
one memory page will be open when another is needed. As 
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described above, increasing the size of the memory page to tors that require fragment stack 212 to be re-sorted prior to 

alleviate this problem will result in processing delays the depth test. These factors can include such things as the 

because of the additional time needed to read (or write) the results of the stencil test, or a change in depth sorting order, 

larger pages. Adding a cache between the end of fragment Because fragment stack 212 is depth-sorted in fragment 

evaluation pipeline 230a and fragment memory 210, or 5 evaluation pipeline 230a, it is desirable to reduce the number 

between fragment memory 210 and the beginning of frag- of operations associated with determining whether fragment 

ment evaluation pipeline 230a, also does not help, because stack 212 is already in sorted order, 
the cache will fill up and thus may contribute to the memory 

thrashing problem (in effect, the cache simply adds another SUMMARY OF THE INVENTION 

stage to the pipeline). 10 Accordingly, what is needed is a method and/or system 

The use of multiple fragment evaluation pipelines can that can be used in a computer graphics system to address 

also aggravate the memory thrashing problem. One pipeline the memory thrashing problem that occurs when reading and 

may be attempting to read a memory page, and another may writing fragment stacks from fragment memory. What is 

be attempting to write to a different page. With multiple also needed is a method and/or system that address the above 

pipelines attempting to access fragment memory 210 at ^ nee d and that can increase the speed at which fragment 

different times, the frequency of memory thrashing will get stacks are read from and written to fragment memory. In 

worse. addition, what is needed is a method and/or system that 

Yet another disadvantage associated with A-buffer archi- address the above needs and that can increase the data 

lecture 200 is that the entire fragment stack (e.g., fragment throughput in a computer graphics system. Furthermore, 

stack 212) is carried all the way through fragment evaluation 20 what is needed is a method and/or system that address the 

pipeline 230a. In some instances, the entire fragment stack above needs and that can reduce the number of operations 

212 is not required. As described above, a significant amount associated with depth-sorting the fragments in a fragment 

of information is associated with each fragment in a frag- stack. What is also needed is a method and/or system that 

ment stack. As a result, the data throughput in A-buffer address the above needs and that can allow status memory 

architecture 200, in particular in fragment evaluation pipe- 25 to operate independently of fragment memory, 

line 230a, is reduced because of the additional time needed The present invention includes a method and system 

to read/write as well as process the data. thereof that satisfy the above needs. These and other advan- 

With reference still to Prior Art FIG. 2, another problem tages of the present invention not specifically mentioned 

associated with A-buffer architecture 200 is that status 3Q above will become clear within discussions of the present 

memory 205 is a part of fragment memory 210. Status invention presented herein. 

memory 205 is assigned to hold a set of three status bits for in one embodiment, a method and system thereof for 

each fragment stack. The status bits include an "exist" bit processing data in a computer graphics system are described, 

that is used to indicate whether the fragment stack currently i n the present embodiment, when a new fragment for a 

holds any fragments, a "valid/invalid" bit to indicate 35 particular pixel location is received, the fragment stack for 

whether the fragment stack at this location is valid or has that pixel location is read from fragment memory. The new 

been invalidated (used to erase areas of A-buffer memory), fragment is appended to the fragment stack, and the resultant 

and a "size" bit to indicate whether the fragment stack has fragment stack is written back to fragment memory before it 

reached the defined maximum number of fragments per is processed in a computer graphics pipeline. Thus, in 

stack. For a typical 1280 by 1024 pixel screen, the three 4Q accordance with the present invention, the write back to 

status bits can require almost one-half megabyte of memory. fragment memory after processing in the computer graphics 

Because status memory 205 is within fragment memory pipeline is eliminated. 
210, it needs to be accessed with the same memory paging i n accordance with the present invention, fragments 
mechanism. This means that A-buffer architecture 200 stored in fragment memory are not sorted according to their 
requires a caching scheme to hold the status memory page 45 distance from the view plane (the z-dimension); instead, 
while the fragment memory page is being worked on (e.g., z-ordered depth sorting is performed in the computer graph- 
read). However, the amount of space available on an inte- ics pipeline. Using an occlude command, occluded 
grated circuit is limited, and therefore the space that can be (blocked) fragments can be deleted from the fragment stack 
used for the caching scheme is also limited. In addition, as before the fragment stack is passed to the computer graphics 
fragment evaluation pipelines grow in length and 50 pipeline. The computer graphics pipeline calculates a pixel 
complexity, there may be more effective uses of the space color for each pixel location. Multiple computer graphics 
assigned to the caching scheme. pipelines can be executed in parallel, and the pixel colors 

An additional disadvantage associated with the prior art is determined in each pipeline can be interleaved to improve 

that fragments in a fragment stack are depth-sorted (by processing efficiency. 

ZKlimcnsion) in fragment memory 210. Prior art systems 55 in one embodiment, an anti-aliasing buffer architecture 
such as A-buffer architecture 200 rely on storing the frag- for processing fragments in a computer graphics system is 
ment stack in sorted order to more easily identify where to described. The anti-aliasing buffer architecture comprises a 
insert a new fragment 202 into a fragment stack 212. fragment memory, one or more fragment evaluation 
Depth-sorting always increases the number of operations pipelines, and a frame buffer. In one embodiment, the 
that need to be performed on the fragment stack 212, That anti-aliasing buffer architecture further comprises a frag- 
is, this technique makes it unnecessary to perform opera- ment memory manager and a fragment evaluation pipeline 
tions to determine that the fragment stack 212 is in sorted manager. In this embodiment, the fragment evaluation pipe- 
order, and to perform operations to sort the fragment stack. line manager receives a new fragment for a particular pixel 
However, the mechanism to complete re-sort the fragment location and writes the new fragment to the fragment 
stack must still be available. 65 memory manager. The fragment memory manager reads the 
'typically, fragment stack 212 is depth-sorted in fragment fragment stack for that pixel location and appends the new 
evaluation pipeline 230a. There are, however, several fac- fragment to the fragment stack. In accordance with the 
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present invention, the fragment stack (including the new 
fragment) is written back to fragment memory before further 
processing in a fragment evaluation pipeline. Thus, the 
fragment stack can be written to its memory page in frag- 
ment memory while that page is still open (that is, before a 
page swap occurs), and therefore page thrashing can be 
prevented. 

In the present embodiment, the fragment memory man- 
ager also sends the fragment stack (including the new 
fragment) to the fragment evaluation pipeline manager, and 
in turn the fragment stack is processed in a fragment 
evaluation pipeline to determine a pixel color for the pixel 
location being evaluated. The pixel color is then stored in the 
frame buffer; when multiple fragment evaluation pipelines 
are being used, the pixel colors determined in each pipeline 
are interleaved to improve efficiency. Significantly, in accor- 
dance with the present invention, fragment stacks are not 
written to fragment memory after processing in the fragment 
evaluation pipeline(s). 

In the present embodiment, each fragment evaluation 
pipeline performs z-ordered depth sorting of the fragment 
stack as a pipeline stage (where the z-dimension indicates 
distance to a view plane). Thus, in accordance with the 
present invention, fragments can be stored in fragment 
memory in arbitrary z-dimension order within a fragment 
stack (that is, fragments in a fragment stack do not need to 
be sorted according to the z-dimension before the fragment 
stack is stored in fragment memory). Therefore, the opera- 
tions associated with depth-sorting the fragments in a frag- 
ment stack is relegated to one or more pipeline stages and do 
not affect overall throughput. 

In the present embodiment, when a fragment evaluation 
pipeline determines that a fragment is blocked from view 
(and thus will not contribute to the pixel color at a pixel 
location), an occlude command is sent to the fragment 
memory manager. The fragment memory manager sets an 
occlude status bit for that pixel location. In one embodiment, 
when the fragment stack for the pixel location is next read 
from fragment memory, the fragment memory manager 
deletes the occluded fragments from the fragment stack. 
Thus, when the occlude status bit is set, the fragment stack 
is "trimmed;" therefore, only visible fragments are for- 
warded to the fragment evaluation pipeline(s). Deleting 
occluded fragments from a fragment stack can increase 
throughput through an anti-aliasing buffer architecture. 
Making a fragment stack shorter means that it can be more 
quickly read from and written to fragment memory. 

In one embodiment, the anti-aliasing buffer architecture 
further comprises a status memory for storing an additional 
bit — the occlude status bit — for each of the pixel locations. 
In this embodiment, the status memory is placed on a 
separate memory bus or in a memory bank separate from 
fragment stack memory. This allows the status memory 
(specifically, a status memory page) to operate indepen- 
dently of a fragment memory page without affecting access 
to fragment data. 

In summary, in accordance with the present invention, 
fragment stacks arc read from and written back to fragment 
memory before processing in the computer graphics 
pipeline(s). It is not necessary for the entire fragment stack 
to be passed all the way through a fragment evaluation 
pipeline before it is written back to fragment memory. 
Fragment stacks being processed in the fragment evaluation 
pipeline(s) are not held up waiting for a memory page swap 
to write back to fragment memory, and vice versa. Also, 
pixel writes to the frame buffer are not held up because of 
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page misses in fragment memory. As a result, processing 
performance in the fragment evaluation pipeline(s) can be 
improved and data throughput in the pipeline(s) can be 
increased, improving the overall performance of a computer 
5 graphics system. 

DESCRIPTION OF THE DRAWINGS 

The accompanying drawings, which are incorporated in 
and form a part of this specification, illustrate embodiments 
10 of the invention and, together with the description, serve to 
explain the principles of the invention: 

Prior Art FIG. 1 is a block diagram showing the stages in 
one embodiment of a computer graphics pipeline executed 
J5 by a computer graphics system. 

Prior Art FIG. 2 is a data flow diagram showing one 
embodiment of an anti-aliasing buffer architecture used in a 
computer graphics system. 

Prior Art FIG. 3 illustrates a polygon spanning multiple 
20 memory pages in a fragment memory used with a computer 
graphics system. 

FIG. 4 is a block diagram of an exemplary computer 
system upon which embodiments of the present invention 
may be practiced. 
25 FIG. 5 is a data flow diagram showing one embodiment 
of an anti-aliasing buffer architecture used in a computer 
graphics system in accordance with the present invention, 
FIG. 6 illustrates the interleaving of pixels/fragment 
3Q stacks in a computer graphics system in accordance with one 
embodiment of the present invention. 

FIG. 7 is a flowchart of the steps in a process for 
processing fragments/fragment stacks in a computer graph- 
ics system in accordance with one embodiment of the 
35 present invention. 

DETAILED DESCRIPTION OF THE 
INVENTION 

In the following detailed description of the present 

40 invention, a method and design for improved fragment 
processing, numerous specific details are set forth in order to 
provide a thorough understanding of the present invention. 
However, it will be recognized by one skilled in the art that 
the present invention may be practiced without these specific 

45 details or with equivalents thereof. In other instances, well 
known methods, procedures, components, and circuits have 
not been described in detail as not to unnecessarily obscure 
aspects of the present invention. 
Some portions of the detailed descriptions which follow 

50 are presented in terms of procedures, logic blocks, 
processing, and other symbolic representations of operations 
on data bits within an intelligent electronic media device. 
These descriptions and representations are the means used 
by those skilled in the data processing arts to most effec- 

55 tively convey the substance of their work to others skilled in 
the art. A procedure, logic block, process, etc., is herein, and 
generally, conceived to be a self-consistent sequence of steps 
or instructions leading to a desired result. The steps arc those 
requiring physical manipulations of physical quantities. 

60 Usually, though not necessarily, these physical manipula- 
tions take the form of electrical or magnetic signals capable 
of being stored, transferred, combined, compared, and oth- 
erwise manipulated in a consumer electronic media device. 
For reasons of convenience, and with reference to common 

65 usage, these signals are referred to as bits, values, elements, 
symbols, characters, terms, numbers, or the like with refer- 
ence to the present invention. 
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It should be borne in mind, however, that all of these a computer graphics subsystem 411 (FIG. 4). In the present 
terms are to be interpreted as referencing physical manipu- embodiment, A-buffer architecture 500 (specifically, frag- 
lations and quantities and are merely convenient labels and ment evaluation pipeline manager 520) responds to control 
are to be interpreted further in view of terms commonly used commands 570 from processor 402 (FIG. 4). Pipeline man- 
in the art. Unless specifically stated otherwise as apparent 5 ager 520 communicates with fragment memory manager 
from the following discussions, it is understood that 560 via fragment control commands 571, and with fragment 
throughout discussions of the present invention, discussions evaluation pipelines 530a and 530Z? via evaluation control 
utilizing terms such as "receiving," "reading," "appending," commands 572. 

"writing," "deleting," "sending," "processing," "storing," . , , A , _ , . mM . 

"interleaving," "identifying," "setting," "performing" or the 1fl J n the discussion below, A-buffer architecture 500 is 

like, refer to the action and processes (e.g., process 700 of 10 discussed in the context of two parallel fragment evaluation 

FIG. 7) of an electronic device such as a microcontroller or pipelines supported by fragment memory 510. However, it 

similar electronic computing device (e.g., dedicated or & appreciated that more than two pipelines can be imple- 

embedded computer system) that manipulates and trans- mented in parallel, and that each pipeline may or may not 

forms data. The data are represented as physical (electronic) have its own fragment memory. 

quantities within the electronic device* s registers and memo- 15 With reference to FIG. 5, in accordance with the present 
ries and is transformed into other data similarly represented invention, fragment stacks (e.g., fragment stack 562) are 
as physical quantities within the electronic device memories read from f ragment memory 510 at fragment memory man- 
or registers or other such information storage, transmission, ager 560 . Significantly, fragment stacks (e.g., fragment stack 
or display screens. 564 ) are also yniUen t0 fragment memory 510 at fragment 
FIG. 4 is a block diagram of graphics computer system 20 memory man ager 560, before fragment stack 564 is evalu- 
400 upon which embodiments of the present invention can ated in a fragment evaluation pipeline (e.g., fragment evalu- 
be implemented Computer system 400 executes a atioD p ip eline 530a or 5306). Therefore, it is not necessary 
computer-controlled graphics program for generating com- for the ^ fr ^ {o be aU ^ t 

plex or three-dimensional images. c t . ... , r c . ' . . , 

r „ Ann & . , a fragment evaluation pipeline before it is written back to 

Computer system 400 comprises a bus or other commu- 25 °" , m ^ m „„, A „ u*> ™ n ,u 10 ^ n nr „ 

. „. r ; , 1Af • »• • r j fragment memory. As will be seen, tnis can improve pro- 

nication means 410 for communicating information, and a & . - ■ c t , ■ % - « n 

An - 1 1 ... . Aint • ■ c cessing performance in fragment evaluation pipelines 530a 

processor 402 coupled with bus 410 for processing infor- , . °* B v y 

mation. Computer system 400 further comprises a random an 

access memory (RAM) or other dynamic storage device 401 In accordance with the present embodiment of the present 

(main memory 401) coupled to bus 410 for storing infor- 30 invention, status memory 505 is in a memory bank separate 

mation and instructions to be executed by processor 402. fro™ fragment memory 510. Alternatively, status memory 

Main memory 401 also may be used for storing temporary 505 can be placed on a separate bus. Sufficient memory 

variables or other intermediate information during execution bandwidth is available for access to the fragment stack data, 

of instructions by processor 402. Data storage device 407 is By placing status memory 505 on a separate memory bus or 

coupled to bus 410 and is used for storing information and 35 in a separate memory bank, status memory 505 (specifically, 

instructions. Furthermore, signal input/output (I/O) commu- a status memory page) can operate independently of a 

nication device 408 is used to couple computer system 400 fragment memory page without affecting access to fragment 

onto, for example, a network. data. In the present embodiment, status memory 505 is 

Computer system 400 can also be coupled via bus 410 to expanded to include an additional bit— the occlude status 

an alphanumeric input device 422, including alphanumeric 40 bit— per pixel location, in order to flag whether the fragment 

and other keys, which is used for communicating informa- stack for a pixel location needs to be trimmed due to 

tion and command selections to processor 402. Another type occlusion. 

of user input device is mouse 423 (or a like device such as In the present embodiment, a new fragment 502 for a 

a trackball or cursor direction keys) which is used for particular pixel location is received by fragment evaluation 

communicating direction information and command selec- 45 pipeline manager 520. Pipeline manager 520 sends new 

tions to processor 402 and for controlling cursor movement fragment 502 to fragment memory manager 560. Fragment 

on display device 421. This input device typically has two memory manager 560 first checks status memory 505 to see 

degrees of freedom in two axes, a first axis (e.g., x) and a if a fragment stack exists yet at that pixel location. If no 

second axis (e.g., y), which allows the device to specify fragment stack exists for a pixel location, only the new 

positions in a plane. 50 fragment is written to fragment memory 510 and then passed 

In accordance with the present invention, also coupled to t0 fragment evaluation pipeline 530a or 5306. The status bits 

bus 410 is graphics subsystem 411. Processor 402 provides for that pixel location are updated to indicate that a fragment 

graphics subsystem 411 with graphics data such as drawing stack now exists. 

commands, coordinate vertex data, and other data related to If a fragment stack exists, fragment memory manager 560 

an object's geometric position, color, and surface param- 55 reads fragment stack 562, corresponding to the particular 

eters. In general, graphics subsystem 411 processes the pixel location, from fragment memory 510 (fragment stack 

graphical data, converts the graphical data into a screen memory). In the present embodiment, if the occlude status 

coordinate system, generates pixel data (e.g., color, shading, bit is not set, fragment memory manager 560 appends (adds) 

texture) based on the primitives (e.g., points, lines, and new fragment 502 to fragment stack 562, making fragment 

polygons including triangles and quadrangles, as well as 60 stack 564; that is, in this embodiment, fragment stack 564 

polygon meshes, strips and the like), and performs blending, comprises the fragments from fragment stack 562 and new 

anti-aliasing, and other functions. The resulting data are fragment 502. Fragment stack 564 is then written back to 

stored in frame buffer 430. A display subsystem (not shown) fragment memory 510 and then to either fragment evalua- 

rcads frame buffer 430 and displays the image on display tion pipeline 530a or 530/? via pipeline manager 520. 

device 421. 65 Thus, in accordance with the present invention, fragment 

FIG. 5 is a data flow diagram showing one embodiment slack 562 is read from a particular memory page in fragment 

of an anti-aliasing buffer (A-buffer) architecture 500 used in memory 510, and fragment stack 564 (including new frag- 
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ment 502) is written to fragment memory 510 soon enough 
thereafter that the same memory page is still open. As such, 
memory page thrashing will not occur. That is, for a frag- 
ment stack for a particular pixel location, there is no need for 
a memory swap to occur in order to perform a read trans- 5 
action at one time, and then to perform a write transaction 
at a later time. 

Thus, fragment evaluation pipelines 530a and 5306 are 
not held up at the front end (or top) waiting for the fragment 
stacks that are to be processed. In addition, fragment evalu- 10 
ation pipelines 530a and 5306 are not held up at the back end 
(or bottom) waiting for fragment stacks to be written to 
fragment memory. Consequently, processing performance 
and data throughput in graphics subsystem 411 (FIG. 4) are 
improved. 15 

With reference still to FIG. 5, in the present embodiment, 
in those instances in which the fragment closest to the view 
plane in fragment stack 562 blocks the visibility of the rest 
of the stack, the fragment stack is considered occluded and 
fragment memory manager 560 will receive an occlude 20 
command 570 from one of the fragment evaluation pipelines 
530a or 5306. Occlude command 570 identifies the pixel 
location of a fragment stack that should be "trimmed" (that 
is, occluded fragments can be deleted from the fragment 
stack identified by occlude command 570). In the present 25 
embodiment, in response to occlude command 570, frag- 
ment memory manager 560 sets the occlude status bit (e.g., 
status data 506a) in status memory 505. The status bits in 
status memory 505 correspond to the same pixel location 
that is associated with the fragment stack to be trimmed 30 
(e.g., fragment stack 562). An occlude status bit exists for 
each pixel location and thus for each fragment stack in 
fragment memory 510. 

In the present embodiment, fragment stack 512 may be 35 
trimmed to two fragments if the occlude status bit (e.g., 
status data 5066) is set. In this case, fragment stack 512 will 
comprise the closest visible fragment from fragment stack 
564 and the new fragment 502. These two fragments are 
stored in fragment memory 510 and passed to an evaluation 4Q 
pipeline 530a or 5306. 

In one embodiment, when fragment stack 562 is read from 
fragment memory 510, fragment memory manager 560 will 
read the occlude status bit 5066 for fragment stack 562. If 
the occlude status bit is set, only the top- most visible 45 
fragment and the new fragment are passed from fragment 
memory manager 560 to pipeline manager 520 in fragment 
stack 512. Only the top-most fragment and the new fragment 
502 are written back to fragment memory 510. If the occlude 
status bit is not set, then stack trimming is not performed and 50 
all fragments, including the new fragment 502, are passed 
from fragment memory manager 560 to pipeline manager 
520 in fragment stack 512. Once the occlude status bit 5066 
has been read, the status bit is turned off (that is, it is no 
longer set, until a subsequent occlude command is received 55 
for that pixel location). 

It is appreciated that, in another embodiment, occluded 
fragments can be deleted from fragment stack 562 as it is 
read from fragment memory 510. In this case, fragment 
stack 512 would include the closest visible fragment from so 
fragment stack 562 and new fragment 502. In one 
embodiment, occluded fragments are deleted from fragment 
stack 562 on the write to fragment memory 510. 

Basically, there are three kinds of pixels in a rendered 
scene or image: background pixels, edge/transparent pixels, 65 
and interior pixels. Background pixels are at locations in the 
image where no polygons exist, and so the background color 
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(or a background image) is used instead. There is no 
fragment stack for background pixels. 

Edge/transparent pixels are pixels involved in showing an 
edge of at least one polygon, or are part of a visible, partially 
transparent polygon. The fragment stack for edge/ 
transparent pixels is not involved with an occlude command 
570, and the entire fragment stack is retained until fragment 
memory 510 is cleared for a new image. An additional 
trimming operation occurs if the number of fragments in the 
fragment stack exceeds the allowed number. In this case, the 
fragments are written back to fragment memory 510 in 
sorted order (according to the value of z, where z indicates 
the distance to the view plane). The color and z values for 
the last two fragments are averaged, and their fragment 
masks are ORed. The resultant fragment represents the last 
two fragments and replaces them in the stack. 

Interior pixels are pixels that exist in the interior of a 
polygon. This type of pixel is generally the most numerous 
in a rendered image. In the present embodiment, occlude 
commands 570 are only applied to non-transparent interior 
pixels. 

In the present embodiment, fragment memory manager 
560 performs stack trimming by filtering fragments based on 
the current mode of z-dimension depth-sorting implemented 
in A-buffer architecture 500. Occlude command 570 is only 
applied to those cases where there are no edges or transpar- 
ency in the topmost fragment. Evaluation pipeline 530a or 
5306 detects the case of an interior fragment occluding all 
fragments underneath it. This situation will result in the 
assertion of occlude command 570, which causes all the 
fragments except the fragment closest to the view plane to 
be removed from the fragment stack. 

In the present embodiment, when fragment memory 560 
sees the occlude status bit 5066 is set for a fragment stack, 
the fragment stack is written to fragment memory 510 as 
follows: the first fragment is written to fragment memory 
510, and the next fragment is checked to see if its 
z-dimension is less than the previous one. If the z-dimension 
is less, then the latter fragment replaces the preceding one in 
fragment memory 510. This process continues until all of the 
fragments in the fragment stack have been compared. The 
new incoming fragment is written to fragment memory 510 
as well. The new stack size is two fragments. 

For example, if the topmost fragment (the fragment with 
a smaller value for the z-dimension) completely covers a 
pixel location and is not transparent, then lower fragments 
(having higher values of z) will not be visible and thus can 
be deleted from the fragment stack. Another case is one in 
which two or more fragments with equal z values form a 
seam between adjoining polygons. In this case, if these 
fragments completely cover the area at the pixel location 
where they meet, then fragments behind them will be 
occluded and can be deleted. In one implementation, frag- 
ments with equal values for the z-dimension may be merged 
into a single fragment depending on a register setting 
selected by the system designer. 

Consequently, if the occlude status bit 5066 is set, frag- 
ment stack 512 will comprise fewer fragments than either 
fragment stack 562 or fragment stack 564. Deleting 
occluded fragments from a fragment stack means that the 
fragment stack can be more quickly read from and written to 
fragment memory 510, and more quickly processed in 
fragment evaluation pipeline 530a. Thus, stack trimming 
will increase data throughput in A-buffer architecture 500. In 
addition, a fragment stack that has been trimmed saves space 
in fragment memory 510. 
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In the present embodiment, fragment stack 512 is sent status bit flags for the pixel location are read from status 

from fragment memory manager 560 to pipeline manager memory 505 to determine if a fragment stack exists yet at 

520. Fragment stack 512 is then fed into one of the fragment that pixel location. 

evaluation pipelines (e.g., 530a) by pipeline manager 520. In i n s tep 720 of FIG. 7, and with reference also to FIG. 5, 

one embodiment, fragment stack 512 is buffered in a frag- 5 t h e "exist" and "valid/invalid" bits in status memory 505 are 

ment stack cache 522 before entering fragment evaluation checked for the particular pixel location to determine if a 

pipeline 530a. fragment stack already exists at that location, and if the 

Continuing with reference to FIG. 5, fragment stack 512 fragment stack is valid. If a valid fragment stack exists at the 

is foraarded to one of the fragment evaluation pipelines pixel location, then process 700 proceeds to step 740; 

530a £>rJ5306. In the present embodiment, fragment evalu- 10 otherwise, process 700 proceeds to step 730. 

ation >lp eli nes 530a and 5306 perform z-ordered depth In step 730, with reference also to FIG. 5, a status bit (the 

sorting of the fragments in fragment stack 512. Thus, in valid/invalid stack bit) is set to "enabled." If a fragment 

accordance with the present invention, fragments can be slack exists at the particular pixel location but it is invalid, 

stored in fragment memory 510 in random (arbitrary) the existing stack is deleted from fragment memory 510. 

z-dimension order within a fragment stack (that is, for is New fragment 502 is written to fragment memory 510 at that 

example, fragments in fragment stack 564 do not need to be location and the status exist bit is set to "enabled." 

sorted according to the z-dimension before the fragment , n ?4Q of nG ^ feference ^ lQ mQ tfae 

stack is stored in tragment memory 51U). As described ^ n{ ^ ( fr ^ 

562) is read from 

above, fragment memory manager 560 can filter fragments r ? -A & & ' 

u a Jt ♦ i e * .u ■ • 1 ™ fragment memory 510. 

based on the current mode of z-orderea depth-sorting imple- 20 0 

mented in A-buffer architecture^ 7 Thus, the operations In ste P 745 > with reference as well to FIG. 5, fragment 

associated with depth-sorting the fragments in a fragment memory manager 560 checks the occlude status bit 5066 for 

stack are relegated to a series of pipeline stages, and do not the i current P*? 1 ion in status memor y 505 If lhe 

affect overall throughput. occlude status bit 5066 is set, then process 700 proceeds to 

r . , t . 1 -nnt r> 4 . t step 780; otherwise, process 700 proceeds to step 750, 

Fragment evaluation pipelines 530a and 5306 function to Z3 r ' r r r 

determine the final pixel color for each pixel location. After In ste P 750 of FIG - 7 ' a S ain with reference to FIG. 5, the 
evaluation in the pipeline, the final pixel color (e.g., final " size " blt in status memory 505 is checked to determine 
pixel 535a) for each pixel location is buffered in pixel-out whether the size of the fragment stack 562 is larger than the 
FIFO (first-in, first-out) 540. Pixels 546 from pixel-out FIFO allowed maximum size (e.g., the number of fragments in the 
540 are stored in frame buffer 430. A display subsystem (not 30 stack is & caieT lhaD the maximum number allowed). If not, 
shown) reads the data from frame buffer 430 and renders an P rocess 700 Proceeds to step 770; if so, process 700 pro- 
image that is displayed on display device 421 (FIG. 4). ceeds t0 ste P 760 * 

In accordance with the present invention, a fragment stack In ste P 760 > with reference also to FIG. 5, if the number 

is not written back to fragment memory 510 after it has been of fragments in the fragment stack 562 exceeds the maxi- 

processed in fragment evaluation pipeline 530a or 530b. 35 mum number, t hen the two fragments furthest from the vi ew 

Accordingly, pixel writes to frame buffer 430 are not held up plane are merged . 

because of page misses in fragment memory 510. In step 770, with reference also to FIG. 5, new fragment 

It is appreciated that, in accordance with the present 502 is added to fragment stack 562, and fragment stack 564 

invention, more than two fragment evaluation pipelines can including new fragment 502 is written to fragment memory 

be used in A-buffer architecture 500. Because memory 510 If the number of fragments in fragment stack 564 is at 

thrashing is not an issue, the processing workload and data tne maximum allowable value (from step 760), then status 

throughput can be increased by adding parallel fragment memory 505 is updated to reflect this, 

evaluation pipelines to expand A-buffer architecture 500. In step 780 of FIG. 7, and with reference to FIG. 5, if the 

Resultant pixel colors from each pipeline are interleaved as 45 occlude status bit 5066 is set, only the top-most fragmen t 

described in conjunction with FIG. 6. (e.g., closest to the view plane) from fragment stack 5?T 

FIG. 6 illustrates the interleaving of pixels/fragment (comprising fragment stack 562 and new fragment 502) is 

stacks in a computer graphics subsystem 411 (FIG. 4) in written back to fragment memory 510. 

accordance with one embodiment of the present invention. In accordance with the present invention, fragment stack 

In this embodiment, four fragment evaluation pipelines 50 564 is written back to fragment memory 510 (FIG. 5) before 

(exemplified by fragment evaluation pipelines 530a and it is processed in fragment evaluation pipeline 530a or 5306. 

5306 of FIG. 5) are executed in parallel. The pixels/ Advantageously, fragment stack 564 is written to fragment 

fragments (exemplified by 535a and 5356) are interleaved in memory 510 soon enough after fragment stack 562 was read 

frame buffer 430 (FIG. 4) in a pattern 600; however, it is that a memory swap is not needed to complete both the read 

appre ciated that there are many other possible patterns. 5S and write transactions for the particular pixel location. As 

FIG. 7 is a flowchart of the steps in a process 700 for such, memory thrashing will not occur, and so processing 

processing fragments/fragment stacks in a computer graph- performance and data throughput are improved, 

ics system 411 (FIG. 4) in accordance with one embodiment In addition, because fragment stack 564 is written to 

of the present invention. In the present embodiment, process fragment memory 510 before processing to determine a 

700 is implemented using A-buffer architecture 500 of FIG. 60 pixel color is performed, it is not necessary to store in 

5. fragment memory 510 the header information pertaining to 

With reference to FIG. 7 and also to FIG. 5, in step 710, lhe final pixel color for each pixel location. Consequently, 

a new fragment 502 for a particular pixel location is fragment stack 564 requires less space in memory, and 

received. In the present embodiment, new fragment 502 is subsequent reads of fragment stack 564 can be performed 

received by fragment evaluation pipeline manager 520. In 65 more quickly. 

this embodiment, new fragment 502 is forwarded by pipe- In the present embodiment, depth sorting is performed by 

line manager 520 to fragment memory manager 560. The fragment evaluation pipelines 530a and 5306 (see step 810, 
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below), and therefore fragments in a fragment stack can be 
stored in random (arbitrary) z-dimension order in fragment 
memory 510. Thus, it is not necessary to perform operations 
to put fragment stack 564, for example, into a sorted order 
based on the value of z. Because z-ordered depth sorting is 5 
done as a series of evaluation pipeline stages, it does not 
affect system throughput. 

In step 790 of FIG. 7, again with reference as well to FIG. 
5, fragment stack 512 is sent to pipeline manager 520. 

In step 800 of FIG. 7, with reference also to FIG. 5, 10 
fragment stack 512 is sent to fragment evaluation pipeline 
530a or 5306. In one embodiment, fragment stack 512 can 
be buffered in fragment stack cache 522 before proceeding 
to fragment evaluation pipeline 530a or 5306. 

In step 810 of FIG. 7, and with reference to FIG. 5, 35 
fragment stack 512 is evaluated in fragment evaluation 
pipeline 530a or 5306. In the present embodiment, fragment 
evaluation pipelines 530a and 5306 perform z-ordered depth 
sorting on the fragments in fragment stack 512. Fragment 
evaluation pipelines 530a and 530 function to determine a 2Q 
final pixel color for each pixel location (and fragment stack) 
being evaluated. 

In step 820 of FIG. 7, with reference also to FIG. 5, in the 
present embodiment, fragment evaluation pipeline 530a or 
5306 identifies whether there are any occluded fragments in 25 
fragment stack 512. If so, fragment evaluation pipeline 530a 
or 5306 sends an occlude command 570 to fragment 
memory manager 560. In response to the occlude command 
570, fragment memory manager 560 sets an occlude status 
bit 506a in status memory 505. Occlude status bit 506a 30 
corresponds to the same pixel location as fragment stack 
512. 

In step 830 of FIG. 7, with reference to FIG. 5 also, pixels 
546 are stored in frame buffer 430. In the present 
embodiment, final pixels 535a and 5356 are sent from 35 
fragment evaluation pipelines 530a and 5306, respectively, 
to pixel-out FIFO 540. Final pixels 535a and 5356 are 
interleaved as described in conjunction with FIG. 6 and 
stored in frame buffer 430. In accordance with the present 
invention, a fragment stack is not written back to fragment 40 
memory 510 after it has been processed in fragment evalu- 
ation pipeline 530a or 5306. Accordingly, pixel writes to 
frame buffer 430 are not held up because of page misses in 
fragment memory 510, improving processing performance 
and data throughput in computer graphics subsystem 411 45 
(FIG, 4). 

In summary, the present invention provides a method and 
system that can resolves the memory thrashing problem that 
occurs when reading and writing fragment stacks from 
fragment memory. The present invention also provides a 50 
method and system that can increase the speed at which 
fragment stacks are read from and written to fragment 
memory. In addition, the present invention provides a 
method and system that can increase the data throughput in 
a computer graphics system. The present invention also 55 
provides a method and system that can allow status memory 
to operate independently of fragment memory. 

The preferred embodiment of the present invention, a 
method and design for improved fragment processing, is 
thus described. While the present invention has been 60 
described in particular embodiments, it should be appreci- 
ated that the present invention should not be construed as 
limited by such embodiments, but rather construed accord- 
ing to the below claims. 

What is claimed is: 65 

1. A method for processing data in a computer graphics 
system, said method comprising the steps of: 



a) receiving a fragment for a pixel location; 

b) reading from fragment memory a first fragment stack, 
wherein said first fragment stack corresponds to said 
pixel location; 

c) appending said fragment to said first fragment stack to 
make a second fragment stack; and 

d) writing said second fragment stack to said fragment 
memory prior to sending said second fragment stack to 
a computer graphics pipeline. 

2. The method as recited in claim 1 wherein said step b) 
comprises the step of: 

bl) deleting an occluded fragment from said first frag- 
ment stack. 

3. The method as recited in claim 1 wherein fragments in 
a fragment stack are stored in arbitrary z-dimension order, 
wherein said z-dimension indicates distance to a view plane. 

4. The method as recited in claim 1 further comprising the 
steps of: 

e) sending said second fragment stack to said computer 
graphics pipeline; 

f) processing said second fragment stack in said computer 
graphics pipeline, wherein said processing determines 
a pixel color for said pixel location; and 

g) storing said pixel color in a frame buffer memory. 

5. The method as recited in claim 4 wherein said step e) 
comprises the step of: 

el) deleting an occluded fragment from said second 
fragment stack. 

6. The method as recited in claim 4 wherein said step f) 
comprises the step of: 

fl) performing z-ordered depth sorting of fragments in 
said second fragment stack. 

7. The method as recited in claim 4 wherein said step g) 
comprises the step of: 

gl) interleaving pixel colors determined from a plurality 
of computer graphics pipelines executing in parallel. 

8. The method as recited in claim 4 further comprising the 
steps of: 

hi) identifying a fragment stack comprising an occluded 
fragment; 

h2) setting an occlude status bit for said fragment stack; 

h3) sending only a visible fragment closest to a view plane 
and said fragment from said fragment stack to said 
computer graphics pipeline when said occlude status bit 
is set; and 

h4) writing said visible fragment closest to said view 
plane and said fragment back to said fragment memory. 

9. The method as recited in claim 8 wherein said status bit 
is set in status memory, wherein said status memory is 
separate from said fragment memory. 

10. A computer system comprising: 
a bus; 

a fragment memory coupled to said bus; 

a frame buffer coupled to said bus; and 

a processor coupled to said bus, said processor for per- 
forming a method for processing data in a computer 
graphics subsystem, said method comprising the steps 
of: 

a) receiving a fragment for a pixel location; 

b) reading from said fragment memory a first fragment 
stack, wherein said first fragment stack corresponds 
to said pixel location; 

c) appending said fragment to said first fragment stack 
to make a second fragment stack; and 
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d) writing said second fragment stack to said fragment process fragment stacks and determine pixel colors for 

memory prior to sending said second fragment stack pixel locations; 

to a computer graphics pipeline. a frame buffer operable to store said pixel colors deter- 

11. The computer system of claim 10 wherein said step b) mined by said fragment evaluation pipeline; and 

of said method further comprises the steps of: 5 circuitry for supplying said fragment stack including a 

bl) deleting an occluded fragment from said first frag- new fragment to said fragment memory prior to pro- 

ment stack. cessing said fragment stack by said fragment evaluation 

12. The computer system of claim 10 wherein fragments pipeline. 

in a fragment stack are stored in arbitrary z-dimension order, 20. The anti-aliasing buffer architecture of claim 19 

wherein said z-dimension indicates distance to a view plane. 10 wherein fragments in said fragment stack are stored in 

13. The computer system of claim 10 wherein said arbitrary z-dimension order, wherein said z-dimension indi- 
method further comprises the step of: cates distance to a view plane. 

e) sending said second fragment stack to said computer 21. The anti-aliasing buffer architecture of claim 20 
graphics pipeline* wherein said fragment evaluation pipeline is operable to 

f) processing said se'cond fragment stack in said computer 15 performz-ordered depth sorting of said fragment slack, 
graphics pipeline, wherein said processing determines 22 ' ™° . buffer architecture of claim 19 
a pixel color for said pixel location; and comprising. 

* , . a fragment memory manager coupled to said fragment 

g) storing said pixel color in said frame buffer memory. ° . , c J * r rr t 
+ J A r~ . ri-i-»L • \ memory, said fragment memory manager operable to 

14. The computer system of claim 13 wherein said step e) 7n j i . t i ic *c j c » * i 
*._,./ . r delete an occluded fragment from said fragment stack 

of said method comprises the step of: * * i j j 

r r in response to an occlude command, 

el) deleting an occluded fragment from said second 23. The anti-aliasing buffer architecture of claim 22 

fragment stack, wherein an occlude status bit for said fragment stack is set 

15. The computer system of claim 13 wherein said step f) in response to said occlude command, wherein only a visible 
of said method comprises the step of: 25 f ragrne nt closest to a view plane and said new fragment are 

fl) performing z-ordered depth sorting of fragments in sent from said fragment stack to said computer graphics 

said second fragment stack. pipeline when said occlude status bit is set. 

16. The computer system of claim 13 wherein said step g) 24. The anti-aliasing buffer architecture of claim 23 
of said method comprises the step of: further comprising: 

gl) interleaving pixel colors determined from a plurality 30 a status memory coupled to said fragment memory 

of computer graphics pipelines executing in parallel. manager, said status memory operable to store an 

17. The computer system of claim 13 wherein said occlude status bit for each pixel location. 

method further comprises the steps of: 25. The anti-aliasing buffer architecture of claim 22 

hi) identifying a fragment stack comprising an occluded wherein said fragment memory manager is further operable 

fragment; t0 append said new fragment to said fragment stack and to 

b2) setting an occlude status bit for said fragment stack; write said fragment stack including said new fragment to 

said fragment memory. 

ra)sendingonlyavisiblefragmentclosesttoaviewplane u ^ anti „ aliasiDg buffer arch itecture of claim 25 

and said fragment from said fragment stack to said f urtri er comprising: 

computer graphics pipeline when said occlude status bit 40 - . . , , . ., 
is set* and a fragment evaluation pipeline manager coupled to said 
1 fragment memory manager and to said fragment evalu- 
h4) writing said visible fragment closest to said view ation pipeline, said fragment evaluation pipeline man- 
plane and said fragment back to said fragment memory. ager operab i e t0 rece i ve ^id fragment stack including 

18. The computer system of claim 17 wherein said status said new f ragmen t from said fragment memory man- 
bit is set in a status memory separate from said fragment 45 a ger and to send said fragment slack including said new 
memory. fragment to said fragment evaluation pipeline. 

19. An anti-aliasing buffer architecture in a computer 2 7. The anti-aliasing buffer architecture of claim 19 
graphics system, said anti-aliasing buffer architecture com- further comprising: 

P nsin S : a plurality of fragment evaluation pipelines executing in 

a fragment memory operable to store a plurality of parallel, wherein pixel colors determined from said 

fragment stacks, wherein a fragment stack is associated fragment evaluation pipelines are interleaved and 

with a pixel location; slore d in said frame buffer, 
a fragment evaluation pipeline coupled to said fragment 

memory, said fragment evaluation pipeline operable to * * * * * 
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